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We discuss concepts of message identification in the sense of Ahlswede and Dueck 
via general quantum channels, extending investigations for classical channels, ini- 
tial work for classical-quantum (cq) channels and "quantum fingerprinting". We 
show that the identification capacity of a discrete memoryless quantum channel for 
classical information can be larger than that for transmission; this is in contrast 
to all previously considered models, where it turns out to equal the common ran- 
domness capacity (equals transmission capacity in our case): in particular, for a 
noiseless qubit, we show the identification capacity to be 2, while transmission and 
common randomness capacity are 1. Then we turn to a natural concept of iden- 
tification of quantum messages (i.e. a notion of "fingerprint" for quantum states). 
This is much closer to quantum information transmission than its classical counter- 
part (for one thing, the code length grows only exponentially, compared to double 
exponentially for classical identification). Indeed, we show how the problem ex- 
hibits a nice connection to visible quantum coding. Astonishingly, for the noiseless 
qubit channel this capacity turns out to be 2: in other words, one can compress two 
qubits into one and this is optimal. In general however, we conjecture quantum 
identification capacity to be different from classical identification capacity. 



1 Introduction 

The theory of identification of messages via (noisy) channels was initiated by 
Ahlswede and DueckpEand has resulted in interesting developments: it has a 
strong connection to the theory of common rando mness F 1 and it has triggered 
the theory of "approximation of output statistics" ,| 16 | 34 | p u ^ briefly, whereas 
in Shannon's famed theory of communicatiorfSl the object is to transmit a 
message over a channel, described by a stochastic map W : X — ► y, by 
block coding of length n, for identification the receiver should only be able 
to answer questions "Is the message i sent equal to one j that I have in 
mind?" . It turns out that this achieves much larger codes than transmission: 
while the latter has an exponentially growing (in n) optimal message set, the 
former allows for dou ble exponential growth. Rabin and YaJ^ — see also 
Kushilevitz and Nisanp^l Example 3.6 and the Bibliographic Notes — have 
shown that the randomised communication complexity of the equality function 
of Ji-bit strings is O(logn) bits. The achievement of Ahlswede and DueclJ^ 
was to determine the constant in the exponent: it is the Shannon capacity 
of the channel — but can be larger in models with feedback^ However, in 
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this paper we will only consider discrete memoryless (classical and quantum) 
channels with no feedback or other helpers, and with an i.i.d. time structure. 
Formally, consider a discrete memoryless quantum channeff^ 



T : Ax 



A 2 , 



modelled as a completely positive, trace preserving map between C*-algebras 
Ai , A2 which in this paper arc always finite dimensional. It is well-known that 
a finite C*-algebra A is isomorphic to a direct sum of full matrix algebras, and 
we can think of it as the subalgebra of the operator algebra B(TL) on a finite 
dimensional Hilbert space Tt, of operators commuting with some selfadjoint 
operator A: 



with the eigenspaces Hi of A. The states on A we identify with the semidef- 
inite operators in A of trace 1, the set of which we denote S(A). Following 
HolevcW we call T a cq-channel if Ai is commutative (=classical), and a 
qc-channel if Ai is commutative: in the first case the action of T is deter- 
mined by its images W x — T(x) on the (finitely many) minimal idempotents 
x, in the second case this applies to the adjoint map T* : A2 — ► Ai, and 
the images M y = T* (y) of the minimal idempotents y form a positive opera- 
tor valued measure (POVM). The channel is classical if both Ai and Ai are 
commutative. 

We shall consider block coding for n copies of T: an (n, X)-code for this 
channel is a collection {(7Tj, Di) : i = 1, . . . , M} of (without loss of generality: 
pure) states 7Tj on Af n , and positive operators Di £ Af n which sum to 1 
(i.e., a POVM), such that 



The maximum such M will be denoted M(n, A). If all the TTi are product 
states with respect to Af n , the code is called separable, and the corresponding 
maximal M is denoted Mx(n, A). The knowledge about these quantities is 
summarised in the HSW-theorem (H(p) = — Trplogp is the von Neumann 
entropy) : 

Theorem 1 (Holevo| l^ 2 Q I 21 j Sc humacher, Westmoreland, 32 

Ogawa, Nagaoka, 29 WinteiESEHl) For all < A < 1, one has the 

coding theorem and strong converse, 
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A = {X : XA = AX} = B(Hi), 



V?: Tr(T® n (7ri)A) > 1 — A. 



(1) 



lim — log Mi (n, A) = x(T) 



with the Holevo capacity of the channel 



x(T) — max < H 
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For general codings, the capacity and the weak converse are given by 
C(T) = inf liminf - logMfn, A) 

A>0 n— *oo n 

= inf lim sup — log M (n, A) = lim -x(T® n ). 

(Here and elsewhere in the paper log and exp are to basis 2.) □ 
It is widely conjectured that x 1S additive with respect to tensor products, 
which would imply C(T) = x(T). Furthermore, it is conjectured that the 
strong converse also holds for M(n, A) (which is known for cq-channels since 
there M(n, A) = Mi(n, A)). 

Following LobeJ^and generalising the classical definition*" an (n, X%, A2)- 
ID code for T is a collection of pairs {(pi,Di) : i = 1, . . . , N} of states ^ on 
Af n and operators < A < 1 in Af n , such that 

Vz Tr(T®"( Pi )A) >1-A l5 

Vi^j Tr(T®"( Pj )A) < A 2 . 

The bounds Ai and A2 are called error probabilities of first and second kind, 
respectively, since the problem is a coding variant of hypothesis testing. For 
the identity channel id^4 of a system A (Kuperberg^l calls this a hybrid quan- 
tum memory), we call an ID code also a code on the algebra A® n , or more 
generally on an algebra A if the pi and Di are elements of A. 

The code is called simultaneous if the binary observables (POVMs) 
(£)j, 1 — Di) are all co-existent, in the sense of LudwigpSI i.e. if there ex- 
ists a POVM (Ek)k=i k such that for all i a set T>i c {1, • • • , K} can be 

found with 

Di = E k- ( 3 ) 

Denote the maximum N such that an (n, Ai, A2)-ID code (a simultane- 
ous (n, Ai,A 2 )-ID code) of length TV exists by N(n, Ai,A 2 ) (iV s j m (n, Ai, A 2 )). 
Clearly, iV sim (n, Ai,Aa~T< iV(n, Ai, A 2 ). 
Theorem 2 (L6beJ23) for a// Ai, A 2 > 0, 

liminf - log log N sim (n,X 1 ,X 2 ) > C{T), 

n — >oo fi 

with the Holevo (transmission) capacity C(T) of the channel. If the channel 
T satisfies the strong converse and in addition the technical condition that on 
block length n its input states may be restricted to an alphabet of size 2 2 ( ' , 
then for < Ai, A2, Ai + A2 < 1, 

lim - loglogiV sim (n, Ai, A 2 ) = C(T). 

n—>oo Ji 

□ 
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The first part is proved by concatenating a transmission code for T with the 
following construction: 

Proposition 3 (Ahlswede, Dueck^) Let M be a set of cardinality M, A > 
and e such that A log (± - l) > 2. Then there exist N > 2^ M ^/M subsets 
Mi C M of cardinality \_eM\, such that 

Vi^j \Mi n Mj\ < X[eM\ . 

In other words, the pairs {(Pi, Mi) : i = 1, ... ,N}, with the uniform distri- 
bution Pi on Mi, form an ID code with error probability of first kind 0, and 
of second kind A. □ 
The technical condition in theorem[21is true for example for cq-channels (2°( n ) 
input strings) but not for the ideal qubit channel idc= : to approximate every 
input state on n qubits requires 2 Z pure states. It is not known if the 

condition is necessary for the conclusion of the theorem but our construction 
following below in section certainly violates it. 

Regarding the necessity of the simultaneity condition, we have 
Theorem 4 (Ahlswede, Winter**!) F° r a cq-channel T and < Ai,A2, 
Ai + A 2 < 1, 

lim -loglogiV(n ) Ai,A2)=C(r)=x(T). 

The same holds for general channels and codes of separable states, as this 
restricts us effectively to a cq-channel. □ 

This left open, however, the understanding of the precise role of the si- 
multaneity condition in general. The present work will push this question 
further a bit: we determine the capacity to be 2 for the noiseless qubit chan- 
nel (theorem EJ, and give a formula for general hybrid quantum memories 
( corollary 1 13(1 . based on a new construction to extend an identification code. 
We furthermore conjecture the capacity to be different for simultaneous iden- 
tification: specifically, that it is 1 for the noiseless qubit. 

While all this is concerned with identification of classical messages, the 
second part of the paper (section |3J) will study concepts of quantum message 
identification. These can be related to visible codings and decodings for quan- 
tum channels. Roughly, quantum message identification aims at encoding the 
pure states \<p) of a Hilbert space, such that a quantum mechanical test of "Is it 
|(9)?" may performed on the output: a binary measurement with distribution 
close to (|(6>|</>)| 2 , 1 — |(#|</>)| 2 ). I.e., the fidelity test can be simulated, which 
has been argued to be the quantum analogue of the identity predicate!^ Our 
findings are in sharp contrast to the case of classical message identification: 
we do not find the doubly exponential growth of code length (here: dimen- 
sion) with the block length, but only exponential growth, like in transmission. 
Still, we find a capacity different from the quantum transmission capacity: it 
is 2 for the noiseless qubit channel ('theorem 119(1 . We conclude with a discus- 
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sion of our results which puts them in their right context, and highlight open 
questions and conjectures. 



2 Classical message identification 

The results of LobeJ^I and of Ahlswede and the authoJ^ seem to indicate 
that the (simultaneous and non-simultaneous) identification capacity of a 
memoryless quantum channel is equal to its transmission capacity. 

We now show that for a general channel — in fact we may take a noise- 
less qubit — the (non-simultaneous) identification capacity can exceed the 
transmission capacity. We begin with a result abou t pure state ID codes: 
Proposition 5 ("Quantum fingerprinting'*!^!) OnB(C d ) there exists an 
ID code {(ipi,ipi) : i = 1, . . . , N} of N > pure states fa = \ipi){^i\, 

with error of first kind and of second kind X, with A log (- — l) > 4. 

Proof. We could simply take the construction of Buhrman et a/.p^but we 
prefer to reduce it to proposition [3] 

Take the probability distributions Pj on {1, ...,d} of an ID code 
{(Pi, -M))} such as proposition (with error probability of second kind A/2) 

and define the states \ipi) — Ylk=i V-P(k)\k). Note that defining Di = \ipi)(ipi\ 
will make the error probability of first kind equal to 0. For the error of second 
kind, note that it is (for i =/= j) 



Using well-known relations between distinguishability measures of distribu- 
tions (see e.g. Fuchs and van de Graai^l) this is upper bounded by A. □ 

Using mixed states, one can increase N dramatically: 

Proposition 6 For every < A < 1 and e > such that A log (- — l) > 8, 

there exists on the quantum system, B(C d ) an ID code with 

messages, with error probability of first kind equal to and error probability 
of second kind bounded by A. The constant K(X) may be chosen 

K(X)= (A/10 ° )4 



41og(100/A)' 

Proof. We concatenate quantum fingerprinting, proposition [S] with the main 
result of the next section, proposition El according to it one can encode the 
pure states on C s , S — \_K(X)d 2 \, into B(C d ) such that the measurements 
(tt, 1 — 7t) can be implemented with accuracy A/2 in the individual output 
probabilities. We simply encode the states from a pure state ID code with 



5 



error of second kind A/2. Looking at the proof of proposition El we see that 
we can assume that the fingerprinting states are part of the net of states (see 
the following lemma) on which the error of first kind does not increase — it 
stays 0. □ 

Using the following lemma we can show that the cardinalities of the codes in 
propositions [5] and are asymptotically optimal. 

Lemma 7 (Bennett et al.,^ lemma 4) For e > 0, there is a set A4 of 
pure states of pure states in d -dimensional Hilbert space with \M\ < (5/e) 2d , 
such that for all pure states \tp) there is \(p) G M with n\(p)(ip\ — |y?}(<^| L < e. 
(We call such a set an e-net .) □ 
Proposition 8 Let {(/?,-, Dj) : i = 1,...,N} be an ID code on B(C d ) with 
error probabilities A 1; A2 of first and second kind, respectively, with X1+X2 < 1. 
Then, 



if all pi are pure, N < 
for general pi, N < 



1 — Ai — A 



2<l 



2 

x 2d 2 
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1 — Ai — A 



Proof. The key insight is that for i ^ j, -^WPi ~~ Pj\\i > 1 ^ Ai — A2, because 
we have an ID code. 

In the first case of pure pi, fix an e-net M. in the pure states, with 
e < 1 — Ai — A2, according to lemma This net decomposes the set of pure 
states into (Voronoi) cells of radius < e, so no two pi can be in the same cell. 
Hence N < (-) , and as e was arbitrary we obtain the first upper bound. 

For the general case pick an e-net in C d (g> C d , according to lemma [7| 
Since every state on C d has a purification on C d ® C d and the trace distance 
is monotonic under partial traces, we obtain an e-net in S(C d ) of cardinality 

(|) 2d . From here we argue as in the pure state case. □ 

We can now determine the identification capacity of the noiseless qubit: 
Theorem 9 For the ideal qubit channel idea and < Ai, A2, Ai + A2 < 1, 

lim - loglogiV(n, A x , A 2 ) = Ci D (id C 2) = 2. 

n — >oo fi 

Proof. Immediate by putting together propositions El and [HI for d = 2™. □ 

This shows that for a general channel T, the (classical message) identi- 
fication capacity is at least the double of its quantum transmission capacity 
Q(T) (i.e., the capacity of the channel to transmit quantum informatio n ' j) . 
Of course, by the general construction from proposition [3] it is also at least 
as large as its classical capacity C(T). In fact, we can show a bit more, with 
the he lp o f the following generalisation of the constructions of Ahlswede and 
Ducck 
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Proposition 10 Let {(pi, Di) : i = 1, . . . , N} be an ID code on A with error 
probabilities Ai, A2 of first and second kind, respectively, and letC be a classical 
system (commutative algebra) of dimension M. Then, for e > 0, there exists 
an ID code {(cr/,-D/) : / = 1, ...,N'} on A ® C with error probabilities 

Ax, A2 + e of first and second kind, respectively, and N' > (-i-/V c ) M . 
Proof. Denote the minimal idempotents of C as [k], k = 1, . . . , M. The new 
code elements and decoding operators will be constructed iteratively via (ran- 
dom) functions / : {1, . . . , M} — > {1, . . . , N}: 

fc fc 

Clearly, the error probability of first type is Ai. Let a maximal code of this 
sort, with error probability of second kind A2 + e, be already constructed, 
with functions fx, . . . , /at' as above. Pick the new function / randomly, i.e. all 
values /(fc) are uniform on {1, ... , N} and independent, and construct c/ and 
Df according to our prescription. 

It is easy to check that the error probability of second kind is bounded as 



Tr(a f D fj ),Tr(a fj D f ) < A 2 + ^\{k : /(fc) = fj(k)}\. (4) 

Fixing j for the moment, we introduce the Bernoulli variables with Xk = 1 
if /(fc) = fj(k) and otherwise. These are evidently independent and all have 
expectation 1/7V. By Sanov's theorerrP^ (with the binary relative entropy 
D(p|| g )=plogf + (l-p)logiEf), 

Pr|^^X fc >e| < exp(-M D(e\\l/N)) < exp(-M(elogiV - 1)). 

With the union bound and using eq. I@J), we can bound the probability that 
one of the error probabilities of second kind exceeds A2 + e, by 

iV'exp(-M(elogiV- 1)); 

if this is less than 1 we can hence enlarge our code, contradicting the maxi- 
mality assumption. This implies the lower bound on N'. □ 

Remark 11 We can make a connection to the second construction of 
Ahlswede and DuecM based on common randomness, observing that the dis- 
tribution on {1,...,M} is always uniform; in other words, the code can be 
used with equal effect if the uniformly distributed k 6 {1, . . . , M} is known 
both to the sender and the receiver. Ahlswede and Dueck showed how to build 
an ID code of rate R, using negligible communication, from rate R of common 
randomness; proposition MIA shows more generally how to increase the rate of 
any given ID code by R. 
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This construction (see also how we use it in the proof of theorem \l%\ below) 
shows in generality that any additional classical capacity besides an ID code 
increases the identification capacity; hence, an optimal ID code for a channel 
does not allow for any remaining transmission rate. 

In this connection we can ask the interesting question if the analogue to 
theorem^ of the above-mentioned result by Ahlswede and DuecEl holds with 
prior shared entanglement: do nE ebits and negligible communication yield 
an identification rate of 2E ? 

Theorem 12 If the channel T simultaneously transmits quantum informa- 
tion at rate Q and classical information at rate R, then Cm(T) > 2Q + R. 

The joint qu antu m-classical capacity region was recently determined by 
Devetak and ShoJM giving a lower bound of 

C m (T)> lim - mzx{2UA)BX) +I(X;B)\, 

n^oo n <j XAB 

where the maximisation is over all states a XAB = ^2 X p x \x)(x\ X <g) cr AB , with 
a AB — (id ® T® n )cf) x for some bipartite pure states <j) x . In the formula, 
I(X]B) = H{cr B ) — J2xPxH(c B ) is the familiar Holevo q uan tity}^ and 
I C (A)BX) = H(a XB ) - H{a XAB ) is the coherent information^ 
Proof. For sufficient block length n, the channel can transmit a Hilbert space 
of dimension d = 2 n ^~^ and simultaneously K — 2 n ( R ~ e ' messages (both 
with fidelity loss/error bounded by e). 

Using an ID code for C d according to proposition [S] and concatenating 
it with the construction of proposition EI] we obtain an (n, A, A)-ID code of 
N' > 2 2 messages, with A — > as n — > oo and e — » 0. □ 

Corollary 13 For a hybrid quantum memory A = B(C di ), 

Cfo(i<U) = (p max ) U^pilogdi + #(p) j = log \J2 d ij • 

Proof. Denoting the maximum in the theorem by T, even the strong converse 
holds: for Ai + A2 < 1, 

lim — log log N(n, Ai, A 2 ) = T. 

n — >oc Ji 

First, the lower bound follows directly from theorem !12l 

The strong converse is proved by building an e-net for S(A® n ), and then 
arguing as in proposition [HJ in detail, we first observe 

P n— type 

where the n-types are distributions on {1, . . . , r} with probability values P, G 
N/n and Ct(p) is a commutative algebra of dimension T(P) < 2 nH ( p \ the 
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number of sequences of type (i.e. empirical distribution) P. The states in this 
algebra are obtained from states on 

by dephasing different P (this is a conditional expectation), tracing out the 
first tensor factor in each product, and subjecting the third factor to a com- 
plete von Neumann measurement. All this can be described by a quantum 
channel R : B(C N ) — ► A® n , with N < (n+ l) r 2" r , since there are < (n+ l) r 
many n-types. Now we can invoke lemmaQto construct an e-net for <S(C ) 
which by virtue of R gives an e-net for S(A® n ). □ 

Remark 14 Now that we see that the identification capacity can be larger 
than the capacity for transmission, we may want to re-examine the signif- 
icance of the condition of simultaneity: it effectively replaces the quantum 
channel by a classical one, with, on block length n, a doubly exponentially 
large net of input states, and classical output which is postprocessed for iden- 
tification. 

Nevertheless, the single measurement may have the effect of likening the 
simultaneous model to transmission, and in particular we conjecture that for 
the ideal qubit channel the capacity is only X: with Ai + A2 < 1. 

C sim -iD(id C 2) = lim - log log N sim (n, A i; A 2 ) = 1. 

n^oo n 

A justification for this may come from work on channel simulations^^ it can 
be shown that any measurement on n qubits can, for all purposes of the out- 
put, be well approximated by a convex combination of measurements each of 
which has only N — 2 I1+ °( n ) many outcomes. The convex combination repre- 
sents randomisation at the output and while in general an ID code cannot be 
derandomised at the output, the quantum situation may be sufficiently special 
to allow such a conclusion. If that works, we would be done, since the output 
probabilities of a measurement of N outcomes permits a net of2°(N) points. 



3 Quantum message identification 

All the previous theory concerned identification of classical messages, though 

via quantum channels, and i s thus analogous to classical information trans- 
it! l^^l 

mission via quantum channels! ^ ■ > l But just as there is also a theory of coding 
quantum information for quantum channels (see the review by Bennett and 
, we can discuss the possibilities of quantum message identification. 
The following concepts appear rather natural: for the quantum channel 
T : Ai — ► A2 we define an (n, A) -quantum-ID code to be a pair (s,D) of 
maps, such that 

e : B{M) — > Af n (5) 
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is a quantum channel and 

D : V(M) — -» A® n (6) 

maps pure states r = to operators < D T < 1, with the properties that 
for all pure states it = \<f>){<f>\, r = \6)(0\ on B(M): 

|Tr(7rr) - Tr( ((T® n e)7r) D T ) | < A/2. (7) 

This means: the encoding is such that for the pure states tt and r a test (the 
binary POVM (D T , t — D T )) may be performed on the output signal [T® n s)ir 
which is stochastically almost equivalent to the test (r, 1 — r) on the original 
state 7r. Why should we regard this as "quantum message identification"? The 
conception that we ought to make a (highly error-free) decision if the state 
7r equals r we dismiss as unphysical: at the output we cannot expect higher 
distinguishability than at the input. For the input, however, the quantum 
mechanical version of the test "Is n equal to t or not?" has been argued*^ 
to be just (t, 1 — t). Also, restricted to a subset of mutually orthogonal 
(or almost orthogonal) states on M, i.e. for classical messages, the concept 
reduces to the classical message identification codes discussed in section 

Now define L(n, A) to be the largest dim M. such that an (n, A)-quantum- 
ID code exists. For the identity channel on some system A we adopt the 
convention to speak of codes on A, similar to the classical case; with en- and 
decoders e : B(M) -> A and D : V(M) -> A. 

The above definition, eq. (JJJ, should be compared with the usual definition 
of a quantum transmission code: there D in eq. 10 is (the restriction of) a 
completely positive, unit preserving map (the adjoint of the decoder channel 
5) 

S* : B{M) — > Af n , such that D T = S*(t). (8) 

Notice that in this case eq. (JJJ implies the familiar fidelity condition, that for 
all 7T, Tr[((<5T® n £)7r)7r] > 1 - A/2. On the other hand it is implied by 

Vtt ||(((5T®V)7r) -tt^ < A. (9) 

In this sense our concept of quantum-ID code (eqs. JSJl, iJBJ) is a "visible 
decoder" version of quantum transmission: to be precise, we shall call it ID- 
visible. A decoder of the form eq. ijSJ we call blind. Noticing that a blind 
decoder allows to perform any measurement, not only the fidelity-test, on the 
input state (S* translates POVMs into POVMs), we are motivated to also 
define (general) visible decoding as a map 

D : POVM(Al) — > POVM(^f n ) 

M = (My)y I ► D(M) = (M' y )y, 

with the error criterion that for all ir e S(A4), 
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Tr{nM v ) ~ Tv[((T® n e)*) M' y ] \ < A. (11) 
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Note that this is implied by eq. © and in turn implies eq. Q. 

All this motivates us to also distinguish blind and visible encoders as well 
(see Barnum et and Bennett et a/0 for a discussion of these concepts in 
quantum source coding and remote state preparation): blind is our definition, 
eq. © above, visible is an encoder map 

E : V{M) — » S(Af n ) (12) 

from the pure states on M into the state space of Af n . 

The type of encoder/decoder we will indicate by a two-letter code, such 
as "bv" or "vV": the first letter refers to the encoder (b: blind, v: visible), 
the second to the decoder (b: blind, V: (general) visible, v: ID-visible). Our 
original definition above is a bv-code, and this will be the default, unless 
otherwise stated. 

We begin our investigation observing that all the L(n, A)-functions exhibit 
only exponential growth: 

Proposition 15 For all < A < 1 and e,/i > 0, such that /xlog (- — l) > 4, 

1 + | log N(n, A/2, A/2 + f i) > L m (n, A) > L vV (n, A) > L vb {n, A) 

> > > 

L bv (n, A) > L bv (n,\) > L bb (n,X). 

Proof. Only the first inequality is nontrivial. For it, simply concatenate a 
given (n, A)-quantum-ID code of maximal dimension L = L vv (n,X) with a 
pure state classical ID code with error probabilities of first and second kind 
and /j,, respectively: this gives an (n, A/2, A/2 + /i)-ID code of cardinality 
2L eL J /L, and observing L vv (n, A) < N(n, A/2, A/2 + (i) the claim follows. □ 

This means that the definition of the various capacities (xy £ 
{vv, vV, vb, bv, bV, 66}) has to be 

Qxy(T) := inf liminf — log£ x „(n, A), 

A>0 n^oo n 

like ordinary transmission capacity Q(T). Qir>(T) will denote the default 
Qbv(T). Note that we adopt the "pessimistic" notion of capacity: achievable 
rates have to be met for every sufficiently large block length n. It is conceivable 
that the "optimistic" definition, where the rate has to be met only infinitely 
often (limsup in the above formula) is larger, not to mention the question of 
the strong converse (allowing fixed positive A). 

Remark 16 For every T , Qbb(T) = Q(T), the transmission capacity. This 
identity extends to Q vb in some cases: <5„b(idc2) = 1 = Q(idc2) and for 
entanglement-breaking T, Q vb (T) — = Q(T). Both statements are implied 
by a result in Bennett et al.'S Theorem 24 (Appendix C), which states that 
to transmit a state of n qubits visibly, using n — c qubits requires Q(2 n ) clas- 
sical bits: in the first case there is no classical channel available, so c = 0; 
in the second case the channel consists of a measurement followed by a state 
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preparation}*** so it cannot be better than a classical channel of bounded ca- 
pacity. This even gives the strong converse for these capacities (X < 1). 

For every T, Q VV (T) < Ciu{T): simply concatenate a vv-quantum-ID 
code with the construction of proposition^ 

For every T, Q v y(T) < C(T): for the states sent through the channels 
once could take an orthogonal basis of M., and as the POVM the correspond- 
ing von Neumann measurement, which yields an (n, A) -code for T. If T is 
entanglement-breaking, then Qbv(T) — Q v v(T) — 0: as above the channel 
is not better than a classical channel of finite capacity, but for this we can 
invoke a result of Ambainis et al.P which implies that to classically simu- 
late the statistics of arbitrary POVMs on arbitrary states of n qubits requires 
transmission of 2 n (") bits. 

The following result is the centre-piece of this paper: 
Proposition 17 For < A < 1, there exists on B(<C d ) a quantum-ID code 

of error A and dimM = S = d 2 4 ^(ioo/a) j • 

The proof uses the following slight modification of a result in Bennett et al., 
Lemma 3 (Appendices A and B): 

Lemma 18 Let tjj be a pure state, P a projector of rank (at most) r and let 
U € U(d) be a random variable, distributed according to the Haar measure. 
Then for e > 0, 



Pr ^Tt(U^U*P) > (1 + e)^} < exp 



e - ln(l 
-r- 



ln2 



Proof. From Bennett et ai.j^ proof of Lemma 3, we take that the probability 
in question is bounded by exp (— r^A^l + e)), with the rate function (see 
Dembo and Zeitounff^ f or definitions) A* of the square of a real Gaussian 
distributed random variable of mean and variance 1 . This is calculated in 
Bennett et al$ Lemma 23 (Appendix A), to 

»*/ x \h(x—l — lnx) : x > 0, 
A*(x) = < 2 

I oo : x < 0, 

which gives the claim. □ 

Proof of proposition [Pl\ Pick an ?y-net, with r/ = A/8, in , P(C' S ) of cardinality 

l^j) , according to lemma [7| The encoder will be an isometry (with a < d 

and S < ad) V : C s — > C d <g> C a , followed by the partial trace over the 
second system C a : e(ir) = Tic a (VttV*) , while the decoder is simply given by 
D r = suppe(r), the support projector onto the image of r, the nearest state 
to t in our ?y-net. We will fix S and a later. 

We shall pick the isometry V randomly (uniformly, i.e. according to the 
invariant measure), and show that with high probability it will yield a code 
of error ry, at least for states from the ?y-net. Then, for arbitrary states 7t,t 
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and their nearest neighbours w, r in the net, 

|Tr7TT — Tre(7r)D T | < |Tr7r(T — r)| + |Tr(7r — 7r)r| 

+ |Tr(e(7r) -e(7r))£> T | + |Trirf- Tre(5f)L> ¥ | 
< 477 = A/2, 

using triangle inequality, D T = and the nonincrease of the trace norm 
under the partial trace. 

So, fix an ordered pair (t = \8)(8\,tt — \4>)(4>\) of states from the ?7-net, 
and write \<f)) = y/a\9) + \fl — a^ 1 -), with a = \ {9\<j))\ 2 . Then we can write, 
with independent random unit vectors \v), \w), 

V\6) = \v), 
V\9 X ) = — ^ ~ ( v \ w )\ v ) 



) - (v\w)\v}\\ 2 

(Note that the denominator vanishes with probability 0.) Now holding \v) 
constant for the moment, so that D T = suppTr a |u)(u| is a constant, we have 

Tr(e( 7 r) J D T ) = (<p\V*(D T $ t)V\<p) 

= ((V5-^^)<«| + ^h) 



\jOL — vi — . 



ik o ((,--vr^^>> + ^K 

1 - a 

(w\D T ® 



f 2 



+ - VI -a^-j-^j — t (v\w) 

( 1— n (v\w) \ \J\ — a , . , 

+ (yEt - Vl-a^J — t (w\v) 

= a + — — [(w\D T ® - (to|vX«|to)] 

where we have denoted t := \\\w) — (v\w)\v)\\2 and used that \v)(v\ < D T ® 1. 
Hence, if 

i 

(w\v)(v\w) < (w\D T ® t\w) < e := (f]/2) 2 , (13) 

we can conclude th at |T re(7r)D T — a < 77. It remains to bound the probability 
of the event in eq. I|13[) : according to lemma IT81 putting a = [ed/2\, we have 

, 1 - In 2 N 



Pr{(w\D T ® > e} < exp -a 



In 2 



Note that D T can have rank a at most; we will assume a > 1 from now, and 
will see at the end that the case where a would be zero is trivial. Putting this 

13 



and eq. together with the union bound, this gives us 
Pr|37r,T from the 77-net |Tre(7r)-D T — Tr7rr| > 77 j 

'5\ 4S / l-ln2 




cd 
2 



161n2 

which is smaller than 1 by our choice of parameters: rj = A/8, e = (rj/2) 2 , 
That is, except when S = (which is implied by a = 0), 

□ 



d 2 (A/100) 4 



41og(100/A) 

in which case the proposition is trivial 



Theorem 19 Q&t,(id C 2) = Q vv (idc 2 ) = 2, and in fact the strong converse 
holds: for all < A < 1, 

lim — log Lbv(n, A) = lim — log L vv (n, A) = 2. 

n — >oo Jl n — >oo n 

Proof. This follows immediately by invoking proposition 1171 for the direct 
part on d = 2 n dimensions, and remark 1161 for the converse. Looking at 
proposition II 51 and the strong converse for N(n, Ai, A2) (proposition |HJ yields 
the strong converse. □ 



Corollary 20 For every channel T, 
Qbv(T) > 2Q bv {T), 



Qw(T) > 2Q vV (T). 



Proof. We begin by observing that by linearity both blind and visible encoders 
can be understood as mappings S(M) — > S(Af n ). The claim follows by 
understanding that one can concatenate a given bV- or vV-code with the 
construction of proposition El this translates a given pure state on M! into 
a mixed state on M. which is then processed further by the encoder to a state 
on Af n ; it translates a fidelity-test on A4' into a binary POVM on Ai which 
the visible decoder turns into a POVM on Af n . □ 

Let us summarise what we can say about two special channels, idc2 (a 
noiseless qubit) and idj (the completely dephasing channel, i.e. a noiseless 
classical bit): 



T = id C 2 : 


x\y 


b 


V 


V 


T = id^: 


x\y 


b 


V 


V 


£7=1 


Qxy b 


1 


1 


2 


£7 = 1 


Qxy < b 








1 


£7id = 2 


V 


1 


1 


2 


Cm = 1 


V 








1 



4 Conclusions 

Consideration of identification problems via quantum channels has yielded 
surprising results: we proved that the unrestricted identification capacity can 
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be larger than the transmission capacity by explicitly computing it for hybrid 
quantum memories. These findings are in marked contrast to the classical 
situation where the identification capacity shows a strong connection to the 
capacity CR(T) of the channel to create common randomness between sender 
and receiver Ml^l (It is possible to find classical channels with Cyd(T) ^ CR(T) 
if one allows exponentially growing alphabets^ or memory 1 ^) Indeed, using 
the Holevo information bound^l it is not difficult to show that the common 
randomness capacity CR(T) is at most C(T); since transmission can always 
be used to set up correlation, the other inequality is trivial, so CR(T) = C(T) 
(see also a forthcoming work by the author). 

This increase (by a factor of 2 for the noiseless qubit channel) may be 
connected to dense coding, i.e., entanglement-assisted transmissionPH If so, 
we would learn that the technical condition in Lober's theorem |2| on alphabet 
size is necessary; also, that something with the conjecture in remark 1141 is 
wrong — note that entanglement increases the cap acity of typical qc-channcls 
(e.g. from 0.585 to 1 for the trine measurement! 1 ^ , and that ID coding for qc- 
channcls is automatically simultaneous due to the built-in measurement. If 
not related to dense coding, there is another possible, geometric, explanation 
for the effect: it may ultimately have to do with the geometry of state space, 
where the dimension of the manifold of mixed states is roughly the square of 
the dimension of the pure state manifold. 

Then we went further and considered identification of quantum states: 
also there we showed that the capacity can exceed the quantum transmission 
capacity, and the same factor 2 occurs for the noiseless qubit. Our construc- 
tion may well be of independent interest, as it essentially consists of a "random 
noisy channel" 

: B(C S ) V ^ B(C ® C u ) ^ B(C*), 

with an (isotropic) random isometry V and s < tu. E.g. for s = 1 we recover 
the generation of random mixed states by partial trace from a random pure 
state (for a recent work see Zyczkowski and SommerJ^) — one can rewrite 
our construction of a classical ID code (proposition [BJ) in these terms, with 
t = O(ulogu); for u = 1 we are selecting a random embedding which is used 
in proofs of the quantum channel coding theorem. 

We had to leave many open questions: is C S j m _iD ^ Cm and what is 
Cid (T) in general? Could it be equal to the lower bound given in theoremll2f 
What is Qm(T) in general — in particular, can it be larger than 2Q(T)1 We 
conjecture that one needs quantum transmission to have Qm{T) ^ and in 
particular that Qio(id^) = Qw{^a) = max{21og<ii : 1 < i < r} for the 
hybrid quantum memory A — B(C di ). Note that this would also imply 

that (bounded) common randomness between the users does not increase the 
quantum identification capacities, unlike the classical case^ (see remark ITT|) : 
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this is certainly true for Qbb (and likely for Q v b, see Bennett et al.lP* Theorem 
24 (Appendix C) for combinations of noiseless quantum and classical channels) 
because we can phrase the error condition as an average pure state fidelity, 
and this can always be achieved without randomisation. Similar questions 
arise in connection with the other visible coding capacities that we defined: 
for example, it seems that visible encoding and bounded capacity classical 
side channels cannot increase our quantum capacities (unbounded classical 
side channels will certainly trivialise the visible encoder variants) . How could 
one prove this in general? A possible strategy that would solve this and relates 

to the above question about Qm{T) = 2Q{T) could be the construction of 
a simulation of T for visible transmission, using Q(T) qubits and some finite 
rate of classical bits. Entanglement binding channels however might pose 
a difficulty here (note that they are the reason that a simulation can only 
possibly exist for visibly given input states). 

The role of the general visible decoding ("V") is not yet very clear, but 
extremely challenging: we conjecture that this model is as hard as usual trans- 
mission but it seems one needs methods beyond those of Schumacher et al^^ 
to prove any bound. One such approach could be based on a further investiga- 
tion of the state space geometric relations in ID coding: to begin with, classical 
identification is an almost-isometry (under statistical distance) from the ver- 
tices of an 0(2 d )-probability simplex into a d-probability simplex; quantum 
fingerprinting maps similarly into the pure states on C d (with trace distance). 
Our quantum-ID codes for the noiseless channel are almost-isometries (un- 
der trace distance) from the set of pure states on C°( d ' into the mixed states 
on C d ; note that here even the dimensions of the two state spaces as mani- 
folds are comparable! In fact, what is more important, they have comparable 
Vapnip-Chervonenkis dimension (also called covering dimension). For the 
V- and b-variants of decoders stricter metric constraints apply, extending to 
larger groups of states, not only pairwise distances, which accounts for the 
capacities being smaller in these models. 
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